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1 Microarchitecture 

Note 



This Data Sheet describes Digital's Alpha 21066 and Alpha 21066A 
microprocessors. Except where the differences are detailed specifically, 
what is described for the 21066 holds true for the 21066A. 



The 21066 microprocessor implements Digital's Alpha architecture. The 
following sections provide an overview of the chip's architecture and major 
functional units. Figure 1 is a block diagram of the 21066 microprocessor. 



1 .1 Overview 



The 21066 microprocessor consists of a core central processing unit (CPU), 
a memory controller, and an I/O controller (IOC). The 21066 also contains 
instruction and data caches (I cache and Dcache) and a serial read-only memory 
(SROM) interface. 

The peripheral component interconnect (PCI) IOC is an interface between 
peripheral devices and the CPU and system memory. It is compatible with the 
PCI Local Bus Specification, Revision 2.0. 

The memory controller interfaces to the system memory and an optional, 
external, backup cache (Bcache). It also contains the embedded graphics 
accelerator. 

The SROM interface provides the initialization data load path from the SROM 
to the Icache. Following initialization, this interface can be converted for use 
as a diagnostic port through the use of privileged architecture library code 
(PAL code). 

The interface unit connects the CPU, the memory controller, the IOC, and the 
SROM interface. It consists of a 64-bit bidirectional data bus, an address bus, 
an invalidate address bus, reset logic, and control. 

The instruction fetch and decode unit (IDU) is the CPU's central control unit. 
It issues instructions, maintains the pipeline, and performs program counter 
(PC) calculations. The CPU also contains four independent execution units: 

• Integer execution unit (I EU) 
Load and store unit (LSU) 

• Floating-point unit (FPU) 



Figure 1 21066 Block Diagram 
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• Branch unit 

Each unit accepts no more than one instruction per cycle; however, correctly 
scheduled code can issue two instructions to two independent units in a single 
cycle. 



1.2 Instruction Fetch and Decode Unit 

The primary function of the I DU is to issue instructions to the I EU, the LSU, 
and the FPU. The I DU contains: 

Pref etcher 

PC pipeline 

Two instruction translation buffers (ITBs) 

Abort logic 

Register conflict or dirty logic 

Exception logic 

Internal processor registers (IPRs) 

Instruction Fetch and Decode 

The IDU decodes two instructions in parallel and checks that the required 
resources are available for both instructions, as follows: 

• If resources are available, both instructions are issued. 

• If resources are available only for the second instruction, neither 
instruction is issued. 

• If the IDU issues only the first of a pair of instructions, it does not advance 
another instruction to attempt another dual issue; dual issue is attempted 
only on aligned quadword (8-byte) pairs. 

Branch Prediction 

The branch unit, or prediction logic, is also part of the I DU . The microprocessor 
offers a register-selectable choice of branch prediction strategies. Each 
instruction location in the instruction cache (Icache) includes a single history 
bit to record the outcome of branch instructions. This information can be used 
to predict the result when the branch instruction is next executed. 

The 21066A supports an improved branch prediction scheme that uses a 
2K x 2-bit history table. The table is indexed by the same bits that index 
the Icache. Each 2-bit table entry behaves as a counter that increments 
on branches taken (stopping at 11 2 ) and decrements on branches not-taken 
(stopping at 00). If the upper bit of the counter is set, the branch is predicted 
taken. The contents of the table are not disturbed by Icache fills. The 21066A 
also supports a static branch-prediction mode that uses the sign bit of the 
branch displacement (as in the 21066). 



Translation Buffers 

The IDU includes two fully associative ITBs: 

• An 8-entry, small-page ITB for 8-KB pages. 

• A 4-entry large-page ITB that supports 4-M B (512 x 8 KB) pages. 

Both translation buffers store recently used, instruction stream (I stream) 
page table entries (PTEs) and use a not-last-used replacement algorithm. In 
addition, both ITBs support a register-enabled extension called the superpage. 
The ITB superpage mappings provide one-to-one virtual PC <33:13> to physical 
PC <33:13> translation when virtual address bits <42:41> = 2. 

Interrupts 

The I DU exception logic supports three sources of interrupts: 

• Hardware interrupts 

- There are three level-sensitive hardware interrupts sourced by pins 
irq<2:0>. 

- There are two internally generated interrupts that respond to external 
interface error conditions. These are sourced by registers in the 
memory controller and the IOC. 

• Software interrupts 

There are 15 prioritized software interrupts, sourced by an onchip register. 

• Asynchronous system traps (ASTs) 

There are four ASTs, one for each processor mode: user, supervisor, 
executive, and kernel. These traps are sourced by an onchip register. 

The interrupt mechanism provides a flexible, software-controlled priority 
scheme that can be implemented by PALcode or by the operating system. 
All interrupts can be independently masked in onchip enable registers. In 
addition, AST interrupts are qualified by the current processor mode. 

Performance Monitoring 

An onchip performance recording mechanism counts various hardware events 
and causes an interrupt upon counter overflow. Two counters are provided to 
allow accurate comparison of two variables under potentially nonrepeatable, 
experimental conditions. The events counted include: 

• Instruction issues 
Nonissues 

• Total cycles 



• Pipeline dry 

• P i pel i ne freeze 

• Cache misses 

• Counts of various instruction classes 

I n addition, two external interface events, such as direct memory access (DMA) 
transactions or external cache accesses, can be counted by programming a 
memory controller register. 

1.3 Integer Execution Unit 

The integer execution unit (IEU) contains the 64-bit integer execution data 
path, which includes the following: 

Adder 

Logic box 

Barrel shifter 

Bytezapper 

Bypassers 

Integer multiplier 

The I E U also contai ns the 32-entry, 64-bit i nteger register file (I RF ). The I RF 
has four read ports and two write ports to simultaneously read operands to 
and write operands (results) from the integer execution data path and the load 
and store unit (LSU). 

1.4 Load and Store Unit 

The LSU contains four major sections: 

• Address translation data path, which includes the data translation buffer 
(DTB) 

Load silo 

• Write buffer 

• Internal processor registers (IPRs) 

Address Translation Data Path 

The address translation data path has a displacement adder that generates 
the effective virtual address for load and store instructions, and a DTB that 
generates the corresponding physical address. 



Data Translation Buffer 

The 32-entry, fully associative DTB stores recently used, data stream (Dstream) 
page table entries (PTEs). The DTB supports four page-size granularity options 
(also called granularity hints) that allow an aligned group of 1, 8, 64, or 512 
pages to be treated as a single larger page. 

The DTB also supports the register-enabled superpage extension. The DTB 
superpage mappings provide virtual-to-physical address translation for two 
regions of the virtual address space: 

• The first region enables superpage mapping when virtual address (VA) bits 
<42:41> = 2. In this mode, the entire physical address space is mapped 
multiple times to the quadrant of virtual address space defined by VA 
<42:41> = 2. 

• The second region maps a 30-bit region of the total physical address 
space, defined by physical address (PA) bits <33:30> = 0, into a single 
corresponding region of virtual address space defined by VA <42:30> = 
1FFE. 

Load Silos 

The LSU contains a memory reference pipeline that can accept a new load 
or store instruction every cycle until a Dcachefill is required. Instructions 
are issued in pipeline stage 3, and the result of each Dcache lookup is not 
known until pipeline stage 6. Therefore, there can be two instructions in the 
LSU pipeline behind a load instruction that misses in the Dcache. These two 
instructions are handled as follows: 

• Loads that hit in the Dcache are allowed to complete (hit-under-miss). 

• Loads that miss are placed in a silo and are replayed in sequence after the 
first load miss completes. 

• Store instructions are presented to the Dcache at their normal time, with 
respect to the pipeline. They are placed in a silo and presented to the write 
buffer in sequence, with respect to loads that miss. 

Write Buffer 

The LSU write buffer has two purposes: 

• The 21066 CPU can generate store data faster than the backup cache 
(Bcache) subsystem can accept the data. This can cause CPU stall cycles. 
The write buffer provides a finite, high-bandwidth resource for receiving 
store data to minimize the number of possible CPU stall cycles. 



• The write buffer also attempts to aggregate store data into aligned, 32-byte 
cache blocks to maximize the rate at which the 21066 can write data into 
the Bcache. 

The 21066A implements revised write buffer unload logic, removing the rare 
possibility that write operations may be buffered indefinitely. 

1.5 Floating-Point Unit 

Theonchip, pipelined floating-point unit (FPU) can execute both IEEE and 
VAX floating-point instructions. The 21066 supports IEEE S_floating and 
T_floating data types, with all rounding modes (except round to ± infinity, 
which can be provided in software). The 21066 fully supports VAX F_floating 
and G_floating data types, and provides limited support for the VAX D_floating 
format. 

The FPU contains: 

• A 32-entry, 64-bit floating-point register file (FRF) 

• A user-accessible control register 

The FPU can accept an instruction every cycle, with the exception of 
floating-point divide instructions. The latency for data-dependent, nondivide 
instructions is six cycles. 

The 21066 supports the IEEE floating-point operations as defined by the Alpha 
architecture. Support for a complete implementation of the IEEE Standard for 
Binary Floating-Point Arithmetic (AN SI /IEEE Standard 754-1985) is provided 
by a combination of hardware and software. The 21066A includes new 
floating-point divide hardware that implements a nonrestoring, normalizing, 
variable-shift (maximum of 4 bits per cycle) algorithm that retires an average 
of 2.4 bits per cycle. The average overall divide latency, including pipeline 
overhead, is 29 cycles for double precision and 19 cycles for single precision 
(compared to 63 and 34 cycles, respectively, in previous implementations). 

Additionally, to avoid the noncompliant (IEEE) divide behavior of previous 
implementations, the new divider calculates the inexact flag, setting the 
inexact (INE) bit in the floating-point control register (FPCR) if appropriate, 
and trapping on DIVx/SI instructions only when the result is really inexact. 1 

The inexact trap disable (I NED) bit has also been added to the FPCR. 



See the Alpha Architecture Reference M anual for more information about the FPCR. 



1.6 Pipeline Organization 

The 21066 has a 7-stage pipeline for integer operate and memory reference 
instructions, and a 10-stage pipeline for floating-point operate instructions. 
The IDU maintains state for all pipeline stages, to track outstanding register 
write operations and to determine I cache hits and misses. 

Figure 2 shows the integer operate, memory reference, and floating-point 
operate pipelines for the IDU, IEU, LSU, and FPU. The first four stages of 
all the pipelines are the same, and are executed in the IDU. The last stages 
are unit specific. All of the units have bypassers that allow the results of one 
instruction to be the operand of a following instruction, without writing the 
results of the first instruction to a register file. 

Figure 2 Instruction Pipelines 
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1.7 Internal Cache Organization 

The 21066 includes two onchip caches— a Dcache and an I cache. All memory 
cells in both caches are fully static, 6-transistor, CMOS structures. 

Data Cache 

The 8-KB Dcache is a write-through, direct-mapped, read-allocate, physical 
cache with 32-byte blocks. When a PCI device writes to cacheable memory, the 
Dcache block corresponding to the memory address is set invalid. The 21066A 
maintains longword cache parity on the Dcache. 

Instruction Cache 

The 8-KB I cache is a physical direct-mapped cache. Each I cache block (line) 
contains: 

I stream data (32 bytes) 

Associated tag (21 bits) 

Address space number (ASN) field (6 bits) 

Address space match (ASM ) field (1 bit) 

Branch history (BHT) field (8 bits in 21066, 16 bits in 21066A) 

The I cache does not contain hardware for maintaining coherency with memory, 
and it is unaffected by PCI write operations to memory. The 21066A maintains 
longword cache parity on the I cache. 

1.8 Memory Controller 

The onchip memory controller interfaces the CPU to the system memory and 
the optional backup cache (Bcache). It has several memory-mapped control 
and status registers (CSRs) to program organization, timing, and the size 
of DRAM, VRAM, and Bcache SRAM. It controls CPU requests and DMA 
requests (from the IOC) to and from memory and the Bcache. It also controls 
VRAM shift-register loads and memory refresh operations. 

The memory controller decodes the address of a CPU request to determine 
whether the request is for memory or the IOC. It handles the access to the 
memory controller CSRs, the memory, and the Bcache. If the request is 
directed at the IOC, the memory controller passes control to the IOC. 

The memory controller can also perform the following graphics operations: 

• Dumb frame buffer operation 

• Transparent stipple operation 

• Write-per-bit plane masking 



• Byte write operations (with external gating) 

• Full and split VRAM shift-register load instructions 

1.9 I/O Controller 

The onchip I/O controller (IOC) is an interface bridge between peripheral 
devices and the CPU and system memory. The IOC interface protocol complies 
with the PCI Local Bus Specification, Revision 2.0. All peripheral devices in 
a 21066-based system can communicate with the CPU and system memory 
through the IOC. Peripheral chips that are PCI compliant can be connected 
directly to the 21066 without any glue logic. The IOC runs asynchronously to 
the CPU, using the PCI clock input. 

The IOC incorporates scatter-gather mapping logic to translate 32-bit 
addresses generated by PCI bus masters to the 34-bit CPU physical address 
space. The IOC implements an 8-entry translation lookaside buffer (TLB) for 
fast translations. Two programmable address windows control PCI peripheral 
device access to system memory. 

1.10 Obtaining Additional Information 

To obtain more information about the 21066 and 21066A microprocessors, the 
Alpha architecture and instruction set, and the PCI, see the Technical Support 
and Ordering I nformation section at the end of this manual. 
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2 Pinout 

Sections 2.1 through 2.3 list the external signals and their associated pins, 
describe the external signals, and list the signals according to function. 

2.1 Signal List 

Table 1 lists the signal associated with each pin. 

Table 1 Signal List 



Pin 


Signal 


Pin 


Signal 


Pin 


Signal 


— 


(Index point) 


B01 


Vdd 


C01 


ad 17 


A02 


Vss 


B02 


ad 12 


C02 


ad 14 


A03 


Vdd 


B03 


ad 11 


C03 


pci elk in 


A04 


Vss 


B04 


ad9 


C04 


ad 10 


A05 


gnt 1 


B05 


par 


C05 


ad8 


A06 


Vdd 


B06 


perrl 


C06 


reql 


A07 


Vss 


B07 


devsel 1 


C07 


framel 


A 08 


lock 1 


B08 


trdy 1 


C08 


stopj 


A09 


Vdd 


B09 


c be 13 


C09 


irdy 1 


A 10 


Vss 


BIO 


c be 10 


CIO 


c be 11 


All 


ad5 


Bll 


ad6 


Cll 


ad7 


A 12 


Vdd 


B12 


ad4 


C12 


ad3 


A13 


Vss 


B13 


ad2 


C13 


adl 


A 14 


adO 


B14 


bc_cs_l 


C14 


bcoej 


A15 


Vdd 


B15 


bc_tag7 


C15 


be tag6 


A16 


Vss 


B16 


be tag5 


C16 


bc_tag3 


A17 


be tag4 


B17 


bc_tag2 


C17 


bedirty 


A18 


Vdd 


B18 


bctagO 


C18 


bcidxtagO 


A 19 


Vss 


B19 


beparity 


C19 


bc_idx_tag2 


A20 


be idx tagl 


B20 


bc_idx_tag3 


C20 


bc_idx_tag4 


A21 


Vdd 


B21 


memaddrO 


C21 


mem addrl 


A22 


Vss 


B22 


mem addr3 


C22 


Vdd 



(continued on next page) 
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Table 1 (Cont.) Signal List 



Pin 


Signal 


Pin 


Signal 


1 


Pin 


Signal 


D01 


Vss 


E01 


Vdd 




H19 


Vdd 


D02 


ad 16 


E02 


ad 18 




H20 


memwroel 


D03 


instr ref 


E03 


ad 15 




H21 


memwritel 


D04 


ad 13 


E04 


Vss 




H22 


mem rd oe 


D05 


Vdd 


E19 


Vdd 




J 01 


ad29 


D06 


Vss 


E20 


mem 


addr5 


J 02 


ad28 


D07 


rst 1 


E21 


mem 


addr6 


J 03 


ad27 


D08 


Vdd 


E22 


mem 


addr9 


J 04 


Vdd 


D09 


Vss 


F01 


ad23 




J 19 


Vss 


D10 


c be 12 


F02 


ad21 




J 20 


mem dataO 


Dll 


Vdd 


F03 


ad 19 




J 21 


mem datal 


D12 


Vss 


F04 


Vdd 




J 22 


Vdd 


D13 


be we 1 


F19 


Vss 




K01 


Vss 


D14 


Vdd 


F20 


mem 


addr7 


K02 


mem data32 


D15 


Vss 


F21 


mem 


addrlO 


K03 


ad31 


D16 


be tagl 


F22 


Vdd " 




K04 


ad30 


D17 


Vdd 


G01 


Vss 




K19 


mem data2 


D18 


Vss 


G02 


ad24 




K20 


mem data3 


D19 


be index 


G03 


ad22 




K21 


mem data4 


D20 


mem addr2 


G04 


ad20 




K22 


Vss " 


D21 


mem addr4 


G19 


mem 


addr8 


L01 


Vdd 


D22 


Vss " 


G20 


mem 


addrll 


L02 


mem data34 






G21 


mem 


casj 


L03 


mem data33 






G22 


Vss " 




L04 


Vss " 






HOI 


Vdd 




L19 


Vdd 






H02 


ad26 




L20 


mem data5 






H03 


ad25 




L21 


mem data6 






H04 


Vss 




L22 


mem data7 



(continued on next page) 
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Table 1 (Cont.) Signal List 



Pin 


Signal 


Pin 


Signal 


Pin 


Signal 


M01 


mem data35 


R19 


Vss 


W01 


Vss 


M02 


mem data36 


R20 


mem data 17 


W02 


mem data53 


M03 


mem data37 


R21 


mem data 16 


W03 


mem data57 


M04 


Vdd 


R22 


Vdd 


W04 


mem data60 


M19 


Vss 


T01 


Vss 


W05 


Vss " 


M20 


mem data9 


T02 


mem data46 


W06 


Vdd 


M21 


mem data8 


T03 


mem data47 


W07 


pll filter 


M22 


Vdd 


T04 


mem data51 


W08 


Vss 


N01 


Vss 


T19 


mem data22 


W09 


Vdd 


N02 


mem data38 


T20 


mem data20 


W10 


irql 


N03 


mem data39 


T21 


mem data 18 


Wll 


Vss 


N04 


mem data40 


T22 


Vss " 


W12 


Vdd 


N19 


mem data 12 


U01 


Vdd 


W13 


mem dsf 


N20 


mem data 11 


U02 


mem data48 


W14 


Vss " 


N21 


mem datalO 


U03 


mem data50 


W15 


Vdd 


N22 


Vss 


U04 


Vss " 


W16 


mem eccl 


P01 


Vdd 


U19 


Vdd 


W17 


Vss " 


P02 


mem data41 


U20 


mem data23 


W18 


Vdd 


P03 


mem data42 


U21 


mem data21 


W19 


mem data31 


P04 


Vss 


U22 


mem data 19 


W20 


mem data29 


P19 


Vdd 


V01 


mem data49 


W21 


mem data25 


P20 


mem data 15 


V02 


mem data52 


W22 


Vss " 


P21 


mem data 14 


V03 


mem data54 






P22 


mem data 13 


V04 


Vdd 






R01 


mem data43 


V19 


Vss 






R02 


mem data44 


V20 


mem data26 






R03 


mem data45 


V21 


mem data24 






R04 


Vdd 


V22 


Vdd 







(continued on next page) 
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Table 1 (Cont.) Signal List 



Pin 


Signal 


Pin 


Signal 


Pin 


Signal 


Y01 


Vdd 


AA01 


mem data55 


AB01 


Vss 


Y02 


mem data56 


AA02 


mem data58 


AB02 


Vdd 


Y03 


mem data59 


AA03 


mem data61 


AB03 


mem data63 


Y04 


mem data62 


AA04 


Vss " 


AB04 


Vss " 


Y05 


pll elk in 


AA05 


pll i ref 


AB05 


Vdd 


Y06 


pll_clk_in_l 


AA06 


pll 5v 


AB06 


reset in 1 


Y07 


pll bypass 


AA07 


Vss 


AB07 


Vss 


Y08 


testclkout 


AA08 


sromd 


AB08 


Vdd 


Y09 


sromoe I 


AA09 


sromclk 


AB09 


irq2 


Y10 


irqO 


AA10 


memreq I 


AB10 


Vss 


Yll 


trst 1 


A A 11 


memack 1 


ABU 


Vdd 


Y12 


tdi 


AA12 


tms 


AB12 


tck 


Y13 


vrefreshl 


AA13 


memdtoel 


AB13 


Vss 


Y14 


mem_rasa_l3 


A A 14 


vframe_l 


ABM 


Vdd 


Y15 


memrasaJO 


AA15 


mem rasa 12 


AB15 


tdo 


Y16 


mem rasb 12 


AA16 


mem rasa 11 


AB16 


Vss 


Y17 


memeccO 


AA17 


mem rasb 11 


AB17 


Vdd 


Y18 


mem ecc2 


AA18 


memrasblO 


AB18 


mem rasb 13 


Y19 


mem_ecc5 


A A 19 


mem_ecc3 


AB19 


Vss " 


Y20 


mem_ecc7 


AA20 


mem_ecc4 


AB20 


Vdd 


Y21 


mem data28 


AA21 


mem data30 


AB21 


mem ecc6 


Y22 


mem data27 


AA22 


Vdd 


AB22 


Vss 
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2.2 Signal Descriptions 

Table 2 describes the function of each external signal in alphabetical order. 

Table 2 Signal Description 



Signal 



Type Description 



ad<31:0> 



be cs I 



bedirty 



bc_idx_tag<4:0> 



be index 





bcoel 





bc_parity 


I/O 


bc_tag<7:0> 


I/O 



bcwel 
c be l<3:0> 



I/O Multiplexed PCI address and data. The byte address is driven 
during the first clock of a PCI transaction, and data is driven 
during subsequent clock cycles. 

O BcacheSRAM chip select. The SRAM s are enabled only when this 

signal is asserted. This signal provides a power-saving feature 
when the Bcache is not being accessed. 

I/O I ndicates the status of the data stored in a cache block. A value of 
indicates that the cache and memory contain the same data; a 
value of 1 indicates that the cache contains more recently written 
data than memory (that is, memory data is stale). It has the same 
timing characteristics as bc_tag<7:0>. 

I/O An index bit or a tag bit, depending on the Bcache size. When the 
signal is an index bit, it is an output. When the signal is a tag bit, 
signal direction depends on cycle type— during a cache fill or write 
operation, the signal is an output; during a cache lookup, the signal 
is an input. 

Transmits Bcache index bit 12 to the cache. 

Bcache output enable signal. 

Transmits the Bcache tag parity to the tag array during cache fill 
and write operations, and from the tag array during cache lookups. 
It has the same timing characteristics as bc_tag<7:0>. 

Represent the upper part of the physical address stored in a cache 
block. During a Bcache lookup operation, the cache drives these 
bits (and possibly bc_idx_tag<4:0>) with the tag that corresponds 
to the current index being driven on mem addr<ll:0>, be index, 
and the nontag bits of bc_idx_tag<4:0>. During a Bcache fill or 
write operation, the memory controller drives the most significant 
address bits on these lines, to be stored in the Bcache tag array. 

O Bcache write-enable signal. 

I/O Multiplexed PCI bus command codes and byte enables. The 
command code is driven during the address cycle of a PCI 
transaction, and inverted byte enables are driven during data 
cycles. 

(continued on next page) 
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Table 2 (Cont.) Signal Description 



Signal 



Type Description 



devsel I 
framej 

gntj 
instr ref 



irdyl 



irq<2:0> 



lock I 



memack I 



I/O Asserted by the device that is addressed by the current PCI 
transaction. 

I/O Asserted at the beginning of a PCI transaction. It also controls the 
number of data transfers during the transaction (burst length). It 
is deasserted during the final data phase of a transaction. 

] Asserted by external arbitration logic when the CPU is granted 

ownership of the PCI . The arbiter is expected to park (default 
grant) the PCI to the CPU when no other device is requesting 
ownership. 

During normal operation, this signal indicates the type of current 
access (instruction or data) on the memory data bus. When high, 
this signal indicates an I stream reference; when low, this signal 
indicates a Dstream reference. 

During reset, the internal PCI clock is output on this pin for test 
purposes. 

I/O Asserted by the initiator of a PCI transaction to indicate that it 
can complete the current data phase of a PCI transaction. During 
a read cycle, this signal is asserted to indicate that the initiator 
is ready to accept read data. During a write cycle, this signal is 
asserted to indicate that the initiator is driving valid write data 
onto ad<31:0>. The current data phase completes when both 
trdyj and irdyj are asserted when sampled. 

1 External interrupt requests. 

During reset, these pins are part of the clock system and are 
tristated to receive the power-up values (through external resistors 
tied to Vss or Vdd) that set the frequency ratio between the 
internal clock and external reference clock. 

] I implements atomic (exclusive) access operations on the PCI . 

I n test mode, this signal is used to test the 24-bit retry timeout 
counter at the chip tester. This test mode can be selected by 
programming the J TAG instruction register (IR) (IR bit 3 =0). 
During test mode, when this signal is high, it selects the lower 
half of the counter for testing; when this signal is low it selects the 
upper half of the counter for testing. 

O This PCI sideband signal is synchronous with pci_clk_in. The 

IOC asserts this signal when it has won arbitration (internal to 
the 21066) for access to memory requested by a memreq I. This 
signal remains asserted until memreq I is deasserted. 

(continued on next page) 
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Table 2 (Cont.) Signal Description 



Signal 



Type Description 



mem addr<Ll:0> 



mem cas I 



mem data<63:0> 



mem dsf 



Transmit the row and column address to memory and the 12 

least significant index bits to the Bcache. For memory read and 
write operations (that is, not refresh operations), signals mem_ 
addr<ll:0> contain a valid row address when mem_ras_l* is 
asserted, and contain a valid column address when mem_cas_l is 
asserted. 

O When asserted during memory read and write operations, this 

signal indicates that mem addr<ll:0>, mem data<63:0> (for 
write operations), and mem_write_l contain valid information. 

During a memory refresh cycle, this signal is asserted before 
memrasj* is asserted. 

I/O Transmit data between the memory controller and either the 
memory or the Bcache. 

Memory (or an optional transceiver, see mem_rd_oe and 
mem_wr_oe_l) drives these signals during a read operation from 
memory, when mem_cas_l and mem_rd_oe are asserted and 
mem_write_l is not asserted. 

The Bcache drives these signals during a read operation from the 
Bcache, when bc_oe_l is asserted. 

The memory controller drives these signals during a write 
operation. Data is valid when either bc_we_l or mem_cas_l, 
memwroej, and memwrite I are asserted. 

During a write-per-bit operation, the memory controller drives 
these signals with the write-per-bit mask. Data is valid when 
memrasj*, memwroej, and memwritej are asserted. 

O Selects a full or split VRAM shift-register load function. This 

special function signal is valid before mem_ras_l* is asserted 
when mem dtoe I is asserted. 



*The term mem_ras_l represents the mem_rasa_l<3:0> and mem_rasb_l<3:0> signals for the selected bank. 

(continued on next page) 
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Table 2 (Cont.) Signal Description 



Signal 



Type Description 



mem dtoe I 



mem ecc<7:0> 



memrasaj <3:0>, 
mem rasb l<3:0> 



Controls the memory output enable function or the VRAM shift- 

register load function. 

For normal memory operations, this multifunction signal is 
deasserted when mem_ras_l* is asserted. During a read cycle, 
this signal is asserted before mem_cas_l is asserted to enable the 
memory data output drivers. 

To indicate a VRAM shift-register load sequence, this signal is 
asserted before mem_ras_l is asserted. The value of mem_dsf 
determines whether a full or split VRAM shift-register load 
sequence is performed. 

I/O Transmit the error correction codes (ECC) between the memory 
controller and either the memory or the Bcache. The memory 
controller generates ECC for write operations and checks it on 
read operations. These signals have the same external timing as 
mem data<63:0>. (Memory storage for ECC is optional, and ECC 
checking can be disabled using the bank configuration registers.) 

These signals also transmit a write byte mask for banks that have 
byte write enabled in the bank configuration register (external 
logic is required to gate the DRAM write signals). When used 
this way, memeccO corresponds to mem_data<7:0>, memeccl 
corresponds to mem_data<3.5:8>, and so on. 

O Each pair of these signals is associated with a bank of memory. 

During normal read and write operations, the assertion of mem_ 
rasj* indicates that mem_addr<ll:0> contains a valid row 
address. Which RAS is asserted depends on all of the following: 

• Which memory bank is addressed 

Whether split bank is enabled in the bank configuration 
register 

• A row address bit 

During a memory refresh cycle, if the refresh enable bit is set, all 
eight of these signals are asserted together. Refresh cycles are the 
CAS-before-RAS type. 

During VRAM shift-register load functions, mem_rasa_l/i and 
mem rasb l/i for bank n are asserted. 



*The term mem_ras_l represents the mem_rasa_l<3:0>and mem_rasb_l<3:0> signals for the selected bank. 

(continued on next page) 
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Table 2 (Cont.) Signal Description 



Signal 



Type Description 



mem_rd_oe O Enables an optional, external, memory transceiver to drive 

data from the memory parts onto mem data<63:0> and 
mem_ecc<7:0>. It is asserted during read cycles when mem_cas_l 
is asserted. This signal should be ignored if a transceiver is not 
used. 

memreq I I This PCI sideband signal is synchronous with pci_clk_in. When 

the IOC samples this signal asserted, it arbitrates (internal to the 
21066) for access to memory. When arbitration to memory has been 
won, the IOC asserts memack I. 

mem_write_l O Provides read and write control for memory and enables loading 

of the write-per-bit mask. For write operations, mem_write_l 
is asserted before mem_cas_l is asserted; for read operations, 
mem_write_l is deasserted before mem_cas_l is asserted. The 
write-per-bit function is activated when mem_write_l is asserted 
before memrasj* . 

mem_wr_oe_l O Enables an optional, external, memory transceiver to drive data 

from mem data<63:0> and mem_ecc<7:0> to the memory parts. 
It is asserted during write cycles when mem_data<63:0> is driven. 
This pin should be ignored if a transceiver is not used. 

par I/O This PCI signal is the even parity bit for ad<31:0> and 

c_be_l<3:0>. 

pci_clk_in I Provides timing for all transactions on the PCI. All of the IOC's 

PCI signals except rst_l are synchronous with this signal. Inputs 
are sampled on, and outputs change state as a result of the rising 
edge of this signal. 

perrj I/O This PCI signal is asserted when a data parity error has been 

detected. 

pi I bypass I Asserted when the external clock input pll_clk_in directly drives 

the internal logic, and the internal clock and external reference 
frequencies are equal. 

pll_clk_in, I For normal operation, a low-speed (less than 50 MHz), single-ended 

pll_clk_in_l clock and an appropriate reference or bias voltage are supplied to 

pllclkin and pllclkinl, respectively. 

To minimize jitter induced by module and package noise, a high- 
speed (greater than 50 MHz) differential reference clock (logically 
complementary, nominal square waves) is supplied to pll_clk_in 
and pllclkinl. 

*The term mem_ras_l represents the mem_rasa_l<3:0> and mem_rasb_l<3:0> signals for the selected bank. 

(continued on next page) 
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Table 2 (Cont.) Signal Description 



Signal 



Type Description 



pi I filter 

pll_i_ref 
pll_5v 

reql 
reset in I 



rstj 

sromclk 
sromd 

sromoe I 

stopl 

tck 
tdi 



I A capacitor connected between this signal and Vss maintains 

stable operation by setting the correct feedback-loop time constant. 
The time constant regulates the speed with which the phased- 
locked loop (PLL) responds to changes in frequency or operating 
conditions. 

] The constant current flowing in a resistor connected between this 

signal and Vss is the reference for analog PLL circuits. It reduces 
the variations in the speed of CMOS devices over a wide range of 
process and operating conditions. 

I This clean +5-V signal is regulated internally to source the 

nominal +3.3 V used by the PLL and associated logic. This on- 
chip isolation is necessary to reduce phase jitter. Connect 5-V 
decoupling capacitors as close as possible to this pin and the Vss 
pins. 

Asserted by the CPU when it needs to initiate a PCI transfer. 
External arbitration logic is required. 

1 Master reset input for the 21066; should be asserted when power is 
first applied to the chip. When it is asserted, certain internal chip 
logic is immediately initialized (some internal state is not reset and 
must be handled by software when the chip boots). I nternal chip 
activity starts 31 cycles after reset_in_l is negated in synchronism 
with the internal clock. 

O PCI reset signal generated by the CPU . The RST bit in the I OC 

PCI soft reset register allows this signal to be asserted under 
software control. This signal is automatically asserted when 
reset_in_l is asserted. 

SROM clock signal when sromoe I is asserted. When sromoe I is 
not asserted, this is software-controlled serial port output data. 

1 SROM data when sromoe I is asserted. When sromoe I is not 
asserted, this is software-controlled serial port input data. 

Asserted after reset_in_l is asserted, and enables the SROM for 
initialization. Following initialization, this signal is deasserted, 
enabling the SROM port to be used as a software-controlled serial 
port. 

I/O The target of a PCI transaction drives this signal to request that 
the initiator stop the current transaction. 

1 J TAG boundary scan clock. 

I J TAG serial boundary scan data-in signal. 

(continued on next page) 
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Table 2 (Cont.) Signal Description 



Signal 



Type Description 



tdo 

test elk out 



O 
O 



tms 
trdyj 



I/O 



trst I 



vframe I 



J TAG serial boundary scan data-out signal. 

An output reference clock to be used only for testing the 21066. 
Its rising edge into a 40-pF load nominally coincides with the start 
of an internal microcycle. The relationship between this signal 
and pll_clk_in can be determined following the second negation of 
reset_in_l after power is turned on or the clock frequency ratio is 
changed. This pin should not be used to drive module-level logic. 



test_clk_out is the internal 
1, test elk out imitates 



For the 21066A, when pi I bypass = 0, 
clock divided by 4; when pllbypass = 
the internal clock. 

J TAG test mode select signal. 

Asserted by the target of a PCI transaction to indicate that it can 
complete the current data phase of a PCI transaction. During 
a read cycle, this signal is asserted to indicate that the selected 
device is driving valid data onto ad<31:0>. During a write cycle, 
this signal is asserted to indicate that the selected device is ready 
to accept write data. The current data phase completes when both 
trdyj and irdyj are asserted when sampled. 

J TAG test access port (TAP) reset signal. This signal must be 
asserted during power-up, to select standard SROM initialization 
of Icache. It may be left continuously asserted if no other J TAG 
functions need to be exercised. 

When this signal is asserted, the memory controller uses the video 
and graphics control register fields to: 

1. Reload the video display pointer with the start-of-video-frame 
value. 

2. Perform a full VRAM shift-register load cycle to the bank 
selected by the start-of-video-frame value. 

3. I ncrement the video display pointer twice, as specified by the 
address increment value. 



(continued on next page) 
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Table 2 (Cont.) Signal Description 



Signal 



Type Description 



vrefresh I 



I When this signal is asserted, the memory controller uses the video 

and graphics control register fields to: 

1. Perform a split VRAM shift-register load cycle to the bank 
selected by the start-of-video-frame value. 

2. I ncrement the video display pointer once as specified by the 
address increment value. 
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2.3 Quick Reference to Signals by Function and Direction 

Table 3 provides a quick reference to the signals, grouped by function. 

Table 3 Signals by Function 



Name 



Qty Type Purpose 



Value at Reset 



Memory Controller Signals 



bc_cs_l 


1 





bc_dirty 


1 


I/O 


bc_idx_tag<4:0> 


5 


I/O 


be index 


1 





bc_oe_l 


1 





bc_parity 


1 


I/O 


bc_tag<7:0> 


8 


I/O 


bcwel 


1 





mem addr<Ll:0> 


12 





memcasl 


1 





mem data<63:0> 


64 


I/O 


memdsf 


1 





memdtoel 


1 





mem ecc<7:0> 


8 


I/O 



mem_rasa_l<3:0> 8 
mem rasb l<3:0> 



O 



Bcache chip select 

Bcache valid 

Bcache index or tag 

Bcache index (bit 12) 

Bcache output enable 

Bcache tag parity 

Bcache tag 

Bcache write-enable 

Row/column address, Bcache 
index 

Column address strobe 

M emory/B cache data 

Disable special function 

Data transfer/output enable 

Memory/Bcache error correction 
code 

Row address strobes 



memrdoe 


1 





Memory read transceiver output 
enable 


memwritel 


1 





Write-enable 


memwroel 


1 





Memory write transceiver output 
enable 


vframej 


1 


1 


Load video display pointer and 
load VRAM shift register 


vrefresh 1 


1 


1 


1 ncrement video display pointer 



and load VRAM shift register 



Driven, asserted 

Tristate 

Tristate 

Driven, UNDEFINED 

Driven, asserted 

Tristate 

Tristate 

Driven, deasserted 

Driven, UNDEFINED 

Driven, deasserted 

Tristate 

Driven, deasserted 

Driven, deasserted 

Tristate 

Driven, deasserted 

Driven, deasserted 

Driven, deasserted 
Driven, asserted 

NA* 

NA* 



*NA =not applicable 



(continued on next page) 
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Table 3 (Cont.) Signals by Function 



Name 



Qty Type Purpose 



Value at Reset 



PCI Signals 



ad<31:0> 

c_be_l<3:0> 

devsel I 

framej 

gntj 

irdyl 

lock_l 

memack I 

memreq I 

par 

pci_clk_in 

perrl 

reql 

rst 

stopl 

trdyj 



32 I/O PCI multiplexed address and 

data bus 



I/O PCI multiplexed cycle command 
and byte enables 



Tristate when gntj is 
deasserted; otherwise, 
UNDEFINED 

Tristate when gntj is 
deasserted; otherwise, 
UNDEFINED 



1 


I/O 


PCI device select 


Tristate 


1 


I/O 


PCI cycle frame 


Tristate 


1 


1 


PCI bus grant 


NA* 


1 


I/O 


PCI initiator ready 


Tristate 


1 


1 


PCI lock 


NA* 


1 





Grant for IOC access to 21066 
memory 


Tristate 


1 


1 


Request for IOC access to 21066 
memory 


NA* 


1 


I/O 


PCI even parity bit 


Tristate ' 
deassert( 
UNDEFI 


1 


1 


PCI clock input 


NA* 


1 


I/O 


PCI parity error 


Tristate 


1 





PCI bus request 


Tristate 


1 





PCI reset 


Asserted 


1 


I/O 


PCI target stop 


Tristate 


1 


I/O 


PCI target ready 


Tristate 



Clock Signals 



pi I bypass 
pll_clk_in 
pll_clk_in_l 
pll_i_ref 



PLL bypass select 
PLL clock input 
PLL clock input low 
PLL reference current 



NA* 
NA* 
NA* 
NA* 



*NA = not applicable 



(continued on next page) 
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Table 3 (Cont.) Signals by Function 



Name 


Qty 


Type 


Purpose 


Value at Reset 


Clock Signals 


pi 1 filter 


1 


1 


PLL low-pass filter capacitor 


NA* 


pll_5v 


1 


1 


PLL voltage supply 


NA* 


resetinl 


1 


1 


Master reset input 


NA* 


testclkout 


1 





Output clock 


Driven, clocking 


JTAG Signals 


tck 


1 


1 


J TAG boundary scan clock 


NA* 


tdi 


1 


1 


J TAG serial boundary scan data 
in 


NA* 


tdo 


1 





J TAG serial boundary scan data 
out 


Determined by the 
state of the J TAG 
controller 


tms 


1 


1 


J TAG test mode select 


NA* 


trstj 


1 


1 


J TAG TAP reset 


NA* 


Interrupt, SROM Interface, 


and instr 


_ref Signals 




instrref 


1 





1 stream or Dstream reference 


If lockj is deasserted 
and the mode is 
PCI_SYNC_MODE, 
this pin is driven with 
pci_clk_in; otherwise, 
this pin is driven with 
the chip internal clock 


irq<2:0> 


3 


1 


External interrupt request 


NA* 


sromclk 


1 





SROM clock or transmit serial 
data 


Driven high 


sromd 


1 


1 


SROM data or receive serial 
data 


NA* 


sromoej 


1 





SROM output enable 


Driven, deasserted 


*NA =not applicable 



Table 4 provides a quick reference to the signals, grouped by direction. 



25 



Table 4 Signals by Direction 





Active 




Active 




Active 


Signal 


Level 


Signal 


Level 


Signal 


Level 


Input Signals 


gntj 


Low 


pll_clk_in_l 


Low 


tck 


High 


irq<2:0> 


High 


pi 1 filter 


High 


tdi 


High 


lockj 


Low 


pll_i_ref 


High 


tms 


High 


memreqj 


Low 


pll_5v 


High 


trstj 


Low 


pci_clk_in 


High 


resetinl 


Low 


vframej 


Low 


pllbypass 


High 


sromd 


High 


vrefreshl 


Low 


pll_clk_in 


High 










Output Signals 


bc_cs_l 


Low 


memcasj 


Low 


memwroel 


Low 


bcindex 


High 


memdsf 


High 


reql 


Low 


bcoel 


Low 


memdtoel 


Low 


rstj 


Low 


bcwel 


Low 


memr asal <3:0> 


Low 


sromclk 


High 


instrref 


High 


memrasbl <3:0> 


Low 


sromoe 1 


Low 


memackl 


Low 


mem_rd_oe 


High 


tdo 


High 


mem add r <U:0> 


High 


mem_write_l 


Low 


testclkout 


High 


I/O Signals 


ad<31:0> 


High 


c_be_l<3:0> 


Low 


mem_ecc<7:0> 


High 


bcdirty 


High 


devsel 1 


Low 


par 


High 


bc_idx_tag<4:0> 


High 


frame_l 


Low 


perrl 


Low 


bcparity 


High 


irdyl 


Low 


stop_l 


Low 


bc_tag<7:0> 


High 


mem data<63:0> 


High 


trdy_l 


Low 
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3 Electrical Specifications 

This section specifies: 

• PCI electrical conformance 

• Absolute maximum ratings 

• Supply current and power dissipation 

• Chip power supply sequencing 

• dc and ac specifications 

3.1 PCI Electrical Specification Conformance 

The 21066 IOC PCI pins conform to the basic set of PCI electrical specifications 
in the PCI Local Bus Specification, Revision 2.0, including: 

• Standard signaling 

Logic levels follow standard TTL thresholds to accommodate PCI drivers 
and receivers implemented with existing CMOS and TTL devices and 
processes. 

• 33-10 support 

The 21066 supports a 33-MHz interconnection of up to 10 PCI devices. 

3.2 Absolute Maximum Ratings 

Table 5 lists the absolute maximum ratings for the 21066. These are stress 
ratings only; extended exposure to the maximum ratings might affect the 
reliability of the device. 

Caution 



Although the 21066 incorporates protective circuitry to resist damage 
from static electric discharge, Digital recommends avoiding high-static 
voltages or electric fields. 
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Table 5 Absolute Maximum Ratings 



Parameter 



Minimum 



Maximum 



Storage temperature range 


-55°C 


+125°C 


Active temperature range (case) 


0°C 


t 


Supply voltage Vdd 


-0.5 V 


3.6 V 


Supply voltage Vcc (pll_5v) 


-0.5 V 


5.5 V 


ESD protection voltage 


NA* 


1500.0 V 


Overshoot (5-V-safe pins) 


NA* 


See Notes for Table 6 


Overshoot (5-V-nonsafe pins) 


NA* 


See Notes for Table 6 


U ndershoot 


NA* 


-1.0 V 



*NA =not applicable 

tSee Table 15 in Section 5 for maximum case temperatures. 



3.3 Supply Current and Power Dissipation 

The supply current and power dissipation are as follows: 







Microprocessor 






Parameter 


21066A-266 


21066A-233 


21066-166 




21066A-100 


Idd 

Power 


7.4 A 

25 W (maximum) 


6.6 A 

23 W (maximum) 


6.1 A 

21 W (maximi 


im) 


3.5 A 

10 W (maximum) 



Test Conditions 

The supply current and power dissipation test conditions are as follows: 



Parameter 



Condition 



Package temperature with heat sink 

Vdd 

pll_5v 

Clock frequency 



See Table 15 in Section 5 for T c (maximum). 

3.465 V 

5.250 V 

266 M H z/233.33 M H z/166.67 M H z/100 MHz 
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3.4 Chip Power Supply Sequencing 

The Vdd (3.3-V) and pll_5v (5-V) supply voltages should ramp up and ramp 
down simultaneously, but the two ramps need not be perfectly aligned. As 
shown in the following relationship, the rule is that the pll_5v supply must 
never exceed the value of the Vdd supply by more than 3.6 V; that is, Vdd 
cannot be less than ground or more than 3.465 V. 

Vdd > (pll_5v - 3.6 V) for V < Vdd < 3.465 V 

This tells us that, when the 5-V supply is 3.6 V or less, the 3.3-V supply can be 
zero. But after the 5-V supply exceeds 3.6 V, the 3.3-V supply must match the 
rise in the 5-V supply, volt for volt. For example, when the 5-V supply reaches 
4.5 V, the 3.3-V supply must be 0.9 V or more (4.5 - 3.6 = 0.9). 

The ramp rates of the two supplies are not part of the equation and only the 
difference in voltages need be considered. However, power supplies with long 
ramp rates (several tens of milliseconds or longer) should be avoided because 
such slow ramp rates are likely to cause excessive die heating. 

If the 3.3-V supply ramps up before the 5-V supply, there are no voltage- 
differential restrictions and the value of the 3.3-V supply can lead the value of 
the 5-V supply by any amount. However, because die power dissipation is high 
in the absence of clocks, and the phase-locked loop (PLL) that generates the 
clocks runs on 5 V, timing is restricted. 

On power-up, if the Vdd supply leads the pll_5v supply, the pll_5v supply 
must reach 4.5 V no more than 1 second after the Vdd supply has reached 2 V 
Generally, there is no problem during power-down provided that the clocks are 
stopped for no longer than 1 second while the 3.3-V supply remains applied. 
This power-down timing restriction can be satisfied by ensuring that the value 
of the Vdd supply will be 2 V or less within 1 second after the value of the 
pll_5v supply is less than 3 V 

Because the rules for the Vdd supply leading the pll_5v supply are more 
difficult to implement, Digital recommends that the pll_5v supply be applied 
and removed before the Vdd supply according to the guidelines in this section. 

The pll_5v pin must be connected directly to a 5-V supply if either the 5-V PCI 
clamps or the PLL are used. 
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3.5 dc Specifications 

Table 6 lists the pin characteristics. 

Table 6 Pin Characteristics 









Internal* 








Internal* 








Pull-Up or 








Pull-Up or 


Signals 


Type 


Notes 


Pull-Down 


Signals 


Type 


Notes 


Pull-Down 


mem data <63:0> 


I/O 


1 


Pull -down 


perrl 


I/O 


1 


— 


mem_ecc<7:0> 


I/O 


1 


Pull -down 


devsel I 


I/O 


1 


— 


mem add r <U:0> 





2 


— 


req It 





2 


— 


memwritel 





2 


Pull-up 


gnt 1 




1 


— 


memrasaj <3:0> 





2 


Pull-up 


rst 1 





2 


— 


memrasbj <3:0> 





2 


Pull-up 


lockj 




1 


— 


memcasj 





2 


Pull-up 


pciclkin 




1 


— 


memdtoel 





2 


Pull-up 


memreq 1 




1 


— 


memdsf 





2 


Pull-up 


memack 1 





2 


— 


mem_rd_oe 





2 


Pull -down 


pll elk in 




1 


— 


memwroel 





2 


Pull-up 


pll_clk_in_l 




1 


— 


bcoej 





2 


— 


pll bypass 




— 


— 


bc_tag<7:0> 


I/O 


1 


Pull -down 


pll filter 




— 


— 


bcparity 


I/O 


1 


Pull -down 


pll_i_ref 




— 


— 


be index 





2 


— 


pllirefret 




— 


— 


be idx tag<4:0> 


I/O 


1 


Pull -down 


testclkout 





2 


— 


bc_cs_l 





2 


Pull-up 


reset in 1 




1 


— 


bcwel 





2 


— 


tdi 




1 


Pull-up 


bedirty 


I/O 


1 


Pull -down 


tdo 





2 


— 


vframel 


1 


1 


— 


tms 




1 


Pull-up 


vrefresh 1 


1 


1 


— 


tck 




1 


Pull-down 


ad<31:0> 


I/O 


1 


— 


trstj 




1 


Pull-up 


c_be_l<3:0> 


I/O 


1 


— 


irq<2:0> 




1 


— 


framej 


I/O 


1 


— 


sromoe I 





2 


— 


trdyj 


I/O 


1 


— 


sromd 


1 


1 


— 


irdyl 


I/O 


1 


— 


sromclk 





2 


— 


stopl 


I/O 


1 


— 


instrref 





2 


— 


par 


I/O 


1 


— 


— 


— 


— 


— 


*l nternal pull-up and 


pull -down 


are on di 


iring reset and. 


TAG operations. 








tThe req_l pin is tristated durir 


g chip or 


PCI reset. 
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Notes for Table 6: 

1 The I/O and input-only pins are 5-V-safe. This means that as long as Vdd 
is 3.3 V ±5%, these pins can be safely exposed to voltages up to 5.7 V 
indefinitely. I n addition, overshoots up to 6.4 V are allowed for up to 5% of 
the duty cycle, 11 ns per pulse. 

The I/O and input-only pins are not 5-V-safe when Vdd is less than 
3.135 V. U nder these conditions, the pins can be exposed to voltages up to 
the greater of 3.6 V or Vdd + 2.6 V for any length of time. Overshoots up 
to the greater of 4.3 V or Vdd + 3.3 V are allowed for up to 5% of the duty 
cycle, 11 ns per pulse. These overshoot limits do not apply to the PCI pins 
unless Vdd <3.1 V and pll_5v >4.75 V. 

2 The output-only pins are not 5-V-safe. This means that when driving, 
these pins can be exposed to voltages up to 3.6 V indefinitely, but cannot be 
exposed to higher voltages. Therefore, if returned reflections are present, 
they must be limited to 3.6 V; and if test equipment is used to overdrive 
an output-only pin, the test equipment must not expose the pin to voltages 
greater than 3.6 V 
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3.5.1 dc Operating Specifications 

Table 7 lists the functional operating dc parameters for the 21066, and Table 8 
lists them for the 21066A. The functional operating range is as follows: 

Vdd = 3.3 V ±5% 

Tease =0°C toT c (maximum) (package temperature with heat sink) except as 

noted. (See Table 15 in Section 5 for maximum case temperatures.) 

Note 



I n Tables 7 and 8, currents into the chip (chip sinking) are denoted 
as positive (+) current. Currents from the chip (chip sourcing) are 
denoted as negative (- ) current. 



Table 7 21066 dc Parameters 



Symbol Parameter 



Min* Max* 



Unit Comments 



Vil Low-level input voltage — 

Vih High-level input voltage 2.0 

Vol Low-level output voltage — 

Von High-level output voltage 2.4 

lil Input leakage current — 

I hi Input leakage current — 

III Input leakage current — 

loz Tri state leakage current — 



Cin Input capacitance 

Co I/O or output-only pin 

capacitance 



0.8 


V 


— 


— 


V 


— 


0.4 


V 


1 ol = 6 mA 


— 


V 


loh =-2 mA 


+30 


M A 


For pins without internal pull- 
up or pull-down @0.4 V or 
2.7 V 


+175 


uA 


For pins with internal pull- 
down Vih =Vdd 


-260 


uA 


For pins with internal pull-up 
Vil =Vss 


±70 

+195 

-195 


M A 
M A 
uA 


Pins without pull-up or pull- 
down @0.4 V or 2.7 V 
Pins with pull-down 
@2.7 V 
Pins with pull-up @0.4 V 


6 


PF 


Frequency =1 MHz, by design 


14 


PF 


Frequency =1 MHz, by design 



*Min = minimum, Max = maximum 



(continued on next page) 
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Table 7 (Cont.) 21066 dc Parameters 



Symbol 


Parameter 


Min* 


Max* 


Unit 


Comments 


Cclk 


CLK capacitance: 
pll elk in 
pll_clk_in_l 
pciclkin 


— 


10 
10 
16 


PF 
PF 
PF 


Frequency =1 MHz, by design 
Frequency =1 MHz, by design 
Frequency =1 MHz, by design 


Viclk 


Differential voltage: 
pll elk in 
pll_clk_in_l 


±0.6 
±3.6 


— 


V 
V 


Vcclk =1.2-V nominal clock 
differential center voltage 


Vbclk 


Inactive clock bias 
voltage: 
pll elk in 
pll_clk_in_l 


1.2 
1.2 


— 


V 
V 


For single-ended clock 
operation 


VOS 


Externally driven pin 
voltage 




Vdd +2.6 


V 


Also applies to PCI pins if 
Vdd <3.1V and 
pll_5v > 4.75 V 


Vos 


Externally driven pin 
voltage 




Vdd + 3.3 


V 


5% duty cycle, 11 ns maximum 
pulse width, also applies to 
PCI pins if Vdd < 3.1V and 
pll_5v > 4.75 V 



Vclamp PCI pin clamp voltaget 



6.8 



V @lih=64mA 

Vdd = 3.3 V - 5% 
pll_5v =5.0V +5% 



*Min = minimum, Max = maximum 

tPCI pin clamps protect only the 21066 and not other devices on the bus. 



Table 8 21066A dc Parameters 



Symbol Parameter 



Min* 



Max* 



Unit Comments 



Vil 


Low-level input voltage 


— 


0.8 


V 


— 


Vih 


High-level input voltage 


2.0 


— 


V 


— 


Vol 


Low-level output voltage 


— 


0.4 


V 


I ol = 6 mA 


Voh 


High-level output voltage 


2.4 


— 


V 


I oh =-2 mA 


Ml 


Input leakage current 


— 


±30 


M A 


For pins with 



up or pull-down @0.4 V or 
2.7 V 



*Min = minimum, Max = maximum 



(continued on next page) 
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Table 8 (Cont.) 21066A dc Parameters 



Symbol 


Parameter 


Min* 


Max* 


Unit 


Comments 


Ihl 


I nput leakage current 


+20 


+175 


uA 


For pins with internal pull- 
down Vih =2.7V 


III 


I nput leakage current 


-20 


-230 


uA 


For pins with internal pull-up 
Vil =0.4V 


loz 


Tristate leakage current 


— 


±70 

+215 
-270 


uA 
M A 
uA 


Pins without pull-up or pull- 
down @0.4 V or 2.7 V 
Pins with pull-down 
@2.7 V 
Pins with pull-up @0.4 V 


Cin 


Input capacitance 


— 


6 


PF 


Frequency =1 MHz, by design 


Co 


I/O or output-only pin 
capacitance 


— 


14 


PF 


Frequency =1 MHz, by design 


Cclk 


CLK capacitance: 
pll elk in 
pll_clk_in_l 
pci_clk_in 


— 


10 
10 
16 


PF 
PF 
PF 


F requency = 1 M H z, by design 
Frequency =1 MHz, by design 
Frequency =1 MHz, by design 


Viclk 


Differential voltage: 
pll elk in 
pll_clk_in_l 


±0.6 
±0.6 


— 


V 
V 


Vcclk =1.2-V nominal clock 
differential center voltage 


Vbclk 


Inactive clock bias 
voltage: 
pll elk in 
pll_clk_in_l 


1.2 

1.2 


— 


V 
V 


For single-ended clock 
operation 


VOS 


Externally driven pin 
voltage 


— 


Vdd +2.6 


V 


pllclockin, pllclockinl 

Applies to all other pins except 



Vos Externally driven pin 

voltage 



Vclamp PCI pin clamp voltaget 



Vdd +3.3 



7.35 



testel kout, pi Ifi Iter , 
pll_i_ref if Vdd < 3.1 V and 
pll_5v>4.75 V 

V 5% duty cycle, 11 ns 
maximum pulse width, 
applies to all other pins except 
testclkout, pll filter, 
pll_i_ref if Vdd < 3.1 V and 
pll_5v>4.75 V 

V lih =69mA, Vdd =3.3V -5% 
pll_5v=5.0V +5% 



*Min =minimum, Max =maximum 

tPCI pin clamps protect only the21066A and not other devices on the bus. 
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3.6 ac Specifications 

The ac specifications consist of input requirements and output responses. The 
input requirements are rise and fall times, pulse widths, and setup and hold 
times. Output responses are delays from clock to signal. 

Test Conditions 

The test conditions for the ac parameters specified in this section (except in 
Table 12) are as follows: 



Parameter 



Condition 



Package temperature with heat sink 

Vss 

Vdd 

Cload 



OV 

3.3 V ±5% 

50 pF unless otherwise specified 



*See Table 15 in Section 5 for maximum case temperatures. 



Figure 3 defines the ac parameter measurements. 
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Figure 3 ac Timing Measurement 

Rise 



Rise Time 
Fall Time 
Pulse Width 



Setup and 
Hold Time 




Output Delay 



Valid to 

High Impedance 



High Impedance 
to Valid 
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Table 9 lists the clock and reset pin ac parameters. 



Table 9 Clock and Reset ac Parameters 



Symbol Parameter/Signals 



Min* 



Max* 



Unit 



Notes 



Tif Internal clock 29 

frequency 

Tfreq Frequency 

pll_clk t 
pll_clk +• 
pci_clk 16.67 



266/233.33/ 
166.67/100 
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Tcyc 


Clock cycle time 
pll elkt 
pll_clk $ 
pciclk 


30 


60 


Thigh 


Clock high time 
pll elkt 
pll_clk $ 
pciclk 


0.4xTcyc 

2.5 

0.4xTcyc 


— 


Tlow 


Clock low time 

pll elkt 
pll_clk $ 
pciclk 


0.4xTcyc 

2.5 

0.4xTcyc 


— 


Tr 


Clock rise time 

pll clkf 
pll_clk +- 
pciclk 


— 


3.0 
0.8 
3.0 


Tf 


Clock fall time 

pll elkt 
pll_clk $ 
pciclk 


— 


3.0 
0.8 
3.0 


Trst 


Reset pulse width 
reset in 1 

rst(PCI) 


10 x Tcyc 
1 


— 


Trss 


Setup time 

resetinl 


5 


— 


Trst 
elk 


Clock active time to 
end of rst 


100 


— 


Tel kout 


Delay time 

test elk out 




25 



MHz 



MHz 



ns 



ns 



ns 



ns 



ns 



ms 
ns 

ns 
Ms 
ns 



1 
1 
2 

1 
1 
2 

>2.0V 



<0.8V, 
by design 



0.8 V to 2.0 V, 
by design 



2.0 V to 0.8 V, 
by design 



* Min =minimum : Max =maximum. 
t Bypass =0 
t Bypass = 1 
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Notes for Table 9: 

1 pllclk refers to both pllclkin and pll_clk_in_l, the differential clock 
inputs to the chip. The maximum internal clock frequency (Tif) is 100 MHz 
for the 21066A-100, 166.67 M Hz for the 21066-166, 233.33 M Hz for the 
21066A-233, and 266 MHz for the 21066A-266. The internal high- 
frequency clock is a function of the programmed multiplier value on 
the irq<^:0> pins during reset. The pllclk maximum frequency must 
be chosen such that the maximum internal clock frequency (Tif) is not 
exceeded. When pllbypass is asserted, the internal clock frequency 
equals the external clock frequency. When pllbypass is deasserted, the 
typical internal clock frequencies are as follows: 



pll_clk_in 


(MHz) = 


16.67 


25.00 


33.33 


Multiplier 


Values 




Internal Clock Frequency (MHz) 


9 




150.00t 


225.00** 


300. 00*** ♦ 


8 




133. 33t 


200.00** 


266.67t** 


7 




116. 67t 


175.00** 


233. 33** 


6 




100.00 


150.00t 


200.001* 


5 




83.33 


125.00t 


166. 67t 


4 




66.67 


100.00 


133. 33t 


3 




50.00 


75.00 


100.00 


2 




33.33 


50.00 


66.67 



The maximum internal frequency (Tif) limits the range of multiplier values for a given 
pll_clk_in frequency. Although all these multiplier values are available, they should not be 
used at this pll_clk_in frequency for the following: 

* 21066A-233 
t 21066A-100 

* 21066-166 

* 21066A-266 

2 The PCI clock input is asynchronous to the internal CPU clock. For correct 
operation, the PCI clock frequency must be less than or equal to the 
internal CPU clock frequency. 

3 Trst typical =100 ms. 

4 Signal reset_in_l is an asynchronous input. The setup time is only for the 
tester. 

5 Clock must run for 100 us before the deassertion of rst. 
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6 Signal test_clk_out delay is measured with respect to the rising edge of 
pllclkin. Guaranteed by design. 

To maintain stable operation, a 0.01-uF capacitor is connected between the 
pllfilter pin and Vss. It sets the feedback-loop time constant needed to 
regulate the speed with which the PLL responds to changes in frequency or 
operating conditions. Additionally, a 5.1-kf? resistor is connected between the 
pll_i_ref pin and Vss. The constant current that flows in the resistor provides 
the reference to the analog PLL circuits. 
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Table 10 lists the memory controller pin ac parameters. The parameters are 
relative to the core clock. 



Table 10 


Memory Controller ac Parameters 








Symbol 


Parameter 


Signals 


Minimum 
ns 


Maximum 
ns 


Notes 


Tsvl 


Setup time 


mem data <63:0> 
mem_ecc<7:0> 


1.50 


— 


1 


Thl 


Hold time 


mem data <63:0> 
mem_ecc<7:0> 


0.50 


— 


1 


Tdl 


Valid delay time 


mem data <63:0> 
mem_ecc<7:0> 


— 


4.50 


2 


Tvzl 


Valid to high-Z delay time 


mem data <63:0> 
mem_ecc<7:0> 


— 


4.00 


3 


Tzvl 


High-Z to valid delay time 


mem data <63:0> 
mem_ecc<7:0> 


1.50 


— 


2 


Tsv2 


Setup time 


bc_tag<7:0> 
bcparity 
bc_idx_tag<4:0> 
bcdirty 


2.00 




1 


Th2 


Hold time 


bc_tag<7:0> 
bcparity 
bc_idx_tag<4:0> 
bcdirty 


0.00 




1 


Td2 


Valid delay time 


bc_tag<7:0> 

bcparity 

bc_idx_tag<4:0> 

bcdirty 

mem addr<ll:0> 

bcjndex 




4.25 


2 


Tvz2 


Valid to high-Z delay time 


bc_tag<7:0> 

bcparity 

bc_idx_tag<4:0> 

bcdirty 

bcjndex 




3.75 


3 


Tzv2 


High-Z to valid delay time 


bc_tag<7:0> 

bcparity 

bc_idx_tag<4:0> 

bcdirty 

be index 


1.50 




2 



(continued on next page) 
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Table 10 (Cont.) Memory Controller ac Parameters 



Symbol 


Parameter 


Signals 


Minimum 
ns 


Maximum 
ns 


Notes 


Td3 


Valid delay time 


memcas I 

memrdoe 

memwroel 


— 


4.00 


2 


Td4 


Valid delay time 


memr asal <3:0> 
memrasbj <3:0> 
memdtoej 
memdsf 
memwritel 




3.75 


2 


Td5 


Valid delay time 


bc_we I 
be oe 1 
bc_cs_l 




4.50 


2 


Tpwl 


Pulse width low time 


vrefreshl 
vframe_l 


10.00 


85.00 


— 


Tpwh 


Pulse width high 


vrefreshl 
vframej 


1000.00 


— 


— 


Tno 


Time by which the assertion 
of vrefresh I and vframej 

must be separated 


vrefreshl 
vframej 


1000.00 







Notes for Table 10: 

1 Setup and hold times are measured with respect to the rising edge of the 
test_clk_out pin. Thetest_clk_out pin, when loaded with a lumped 40-pF 
load, imitates the internal clock. For the 21066A, when pllbypass =0, 
test_clk_out is the internal clock divided by 4; when pllbypass =1, 
test_clk_out imitates the internal clock. The testclkout pin is intended 
for test purposes only. 

2 The drive times assume a lumped, 40-pF load and are measured with 
respect to the test_clk_out pin. For the 21066A, pllbypass = 1. 

3 The pin is defined to be in tristate when a 2-mA current source changes the 
output voltage by 50 mV. The pin is assumed to be connected to a lumped 
40-pF load for this test. 

Table 11 lists the I/O controller pin ac parameters. The parameters are relative 
to the PCI clock signal pci_clkjn. 
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Table 11 IOC Pin ac Parameters 



Symbol Parameter 



Signals 



Minimum Maximum 



Tval 



Clock to signal valid 
delay time 



Tival 



Clock to signal invalid 
delay time 



Ton 



High-Z to active 
delay time 



Toff 



Active to high-Z 
delay time 



ad<31:0> 

c_be_l<3:0> 

frame 

trdyj 

irdyj 

stop I 

par 

perr I 

devsel I 

lockj 

ad<31:0> 

c_be_l<3:0> 

frame 

trdyj 

irdyj 

stopl 

par 

perrl 

devsel I 

lock_l 

ad<31:0> 

c_be_l<3:0> 

frame 

trdy_l 

irdyj 

stopl 

par 

perrl 

devsel I 

lockj 

ad<31:0> 
c_be_l<3:0> 
frame 
trdyj 
irdyj 
stop I 
par 
perr I 
devsel I 
lock I 



11 ns* 



2.0 ns 



2.0 ns 



2.0 ns 



28 ns 



•Cload =50pF 



(continued on next page) 
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Table 11 (Cont.) IOC Pin ac Parameters 



Symbol Parameter 



Signals 



Minimum Maximum 



Tsu 


Input signal valid 


ad<31:0> 




setup time 


c_be_l<3:0> 

frame 

trdyj 

irdyl 

stopj 

par 

perrj 

devsel I 

lockj 

memreq I 


Th 


Input signal hold time 


ad<31:0> 

c_be_l<3:0> 

frame 

trdyj 

irdyl 

stopj 

par 

perrj 

devsel I 

lockj 

memreq I 


Tackv 


Valid delay time from 
clock rising edge 


memack I 


Tval-side 


Signal valid delay time 


reqj 


Tsu-side 


Signal valid setup time 


gntj 



7.0 ns 



0.0 ns 



3.6 ns 



12 ns 



15 ns* 
12 ns 



"Cload =50 pF 



If a 5-V signaling environment is used on the PCI bus, the pll_5v pin must 
be connected to a 5-V supply. Table 12 (abridged from the PCI Local Bus 
Specification, Revision 2.0) specifies the ac parameters for 5-V signaling. 
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Table 12 ac Specifications for 5-V Signaling 



Symbol 


Parameter 


Condition 


Minimum 


Maximum 


Unit 


Id 


Low clamp 
current 


-5 <Vin < -1 


-25 +(Vin +1)/ 0.015 


— 


mA 


Tr 


Unloaded 


0.4 V to 2.4 V 


1 


— 


V/ns 




output 
rise time 










Tf 


Unloaded 


2.4 V to 0.4 V 


1 


— 


V/ns 




output 
fall time 
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Table 13 lists the J TAG pin ac parameters. 



Table 13 JTAG Pin ac Parameters 



Symbol 


Parameter 


Signals 


Minimum 


Maximum 


Unit 


Comments 


Tjf 


Frequency 


tck 





10 


MHz 


— 


Tjp 


Period 


tck 


100 


— 


ns 


— 


Tjht 


High time 


tck 


45 


— 


ns 


— 


Tjlt 


Low time 


tck 


45 


— 


ns 


— 


Tjrt 


Rise time 


tck 


— 


10 


ns 


Measured between 0.8 V and 
2.0 V 


Tjft 


Fall time 


tck 


— 


10 


ns 


Measured between 2.0 V and 
0.8 V 


Tjs 


Setup time 


tdi 
tms 


10 


— 


ns 


With respect to tck rising edge 


Tjh 


Hold time 


tdi 
tms 


25 


— 


ns 


With respect to tck rising edge 


Tjd 


Valid delay 


tdo 


— 


30 


ns 


With respect to tck falling edge 
Cload =50 pF 


Tjfd 


Float delay 


tdo 


— 


30 


ns 


With respect to tck falling edge 



Table 14 lists the ac parameters for the miscellaneous pins. The parameters 
specified are for test purposes only and are measured with respect to the 
testclkout signal. 



Table 14 Miscellaneous Pin ac Parameters 



Symbol Parameter Signals Minimum Maximum 



Tmst 



Tmht 



Setup time irq<2:0> 5 ns 
sromd 



Hold time 



irq<2:0> ns 
sromd 
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4 Mechanical Specifications 



Figures 4 and 5 show the 287-pin standard pin grid array (PGA) package and 
its dimensions. 



Figure 4 21 066/21 066A Package— Top and Side 
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Figure 5 21 066/21 066A Package— Bottom 
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5 Thermal Specifications 

Sections 5.1 and 5.2 specify the 21066 operating temperature and thermal 
resistance. 

5.1 Operating Temperature 

The operating temperature (T c ) of the 21066 is measured at the center of the 
heat sink between the two package studs. The 21066 is specified to operate 
within a temperature range from 0°C toT c (maximum), which is based on 
the operating frequency of the chip. Table 15 gives the maximum operating 
temperatures for the 21066. 

Table 15 Maximum T c at Various Frequencies 

T c (Maximum) 



Frequency 21066 21066A 



100 MHz — 93°C 

166 MHz 85°C 

233 MHz — 84°C 

266 MHz — 70°C 



5.2 Thermal Resistance 

The following equations define the heat-sink-to-ambient thermal resistance 
values: 

^hs-a ~ p 

T c = T a + Px e hs - a 

The symbols in the previous equations are defined as follows: 

©hs-a is the heat-sink-to-ambient thermal resistance (°C/W). 

T a is the ambient temperature (°C). 

T c is the case temperature measured at a predefined location on the 

heat sink (°C). 
P is the power dissipation (W). See Section 3.3, which details chip power 

consumption at various frequencies. 
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Table 16 lists the @h s - a values for several heat sinks used with the 21066 
287-pin ceramic PGA. 

Note 



The heat sink greatly improves the ambient temperature requirement 
and Digital recommends its use. 



Table 16 6>^ s _ a at Various Airflows 

h »-a with: 



Airflow (ft/min) 


Heat Sink 1 


Heat Sink 2 


Heat Sink 3 


50 


2.65 


3.70 


7.35 


100 


1.95 


2.80 


6.55 


200 


1.35 


1.85 


5.00 


400 


1.00 


1.30 


3.10 


600 


0.85 


1.10 


2.10 


800 


0.7 


0.9 


1.65 



Heat sink 1 (11 fins): 2.55 x 2.55 x 1.2 in (6.5 x 6.5 x 3.0 cm) 

Heat sink 2 (13 fins): 2.27 x 2.27 x 0.9 in (5.8 x 5.8 x 2.3 cm) 

Heat sink 3 (14 fins): 2.38 x 2.10 x 0.3 in (6.0 x 5.3 x 0.8 cm) 

All heat sinks are unidirectional and made of aluminium alloy 6063. 

TheGRAFOIL pad is the interface material between the package and heat sink. 
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6 Register Summary 



The tables in this section provide a summary of the 21066 implementation- 
specific internal processor registers (IPRs), the memory controller 
registers, and the I/O controller (IOC) registers. For information about 
the architecturally specified IPRs, see the Alpha Architecture Reference 
Manual. 



Table 17 21066-Specific Internal Processor Registers 



Mnemonic 



Register Name 



Field* Index* 



Instruction Fetch and Decode Unit Registers 



ASTER 

ASTRR 

EXC_ADDR 

EXC_SUM 

HIER 

HIRR 

ICCSR 

ITBASM 

ITBIS 

ITB_PTE 

ITB_PTE_TEMP 

ITBZAP 

PAL_BASE 

PS 

SIER 

SIRR 

SL_CLR 

SL_RCV 

SL_XMIT 

TB TAG 



Asynchronous system trap interrupt enable 

Asynchronous system trap request 

Exception address 

Exception summary 

Hardware interrupt enable 

Hardware interrupt request 

Instruction cache control and status 

Instruction translation buffer address space 

match 

Instruction translation buffer initial state 

Instruction translation buffer page table entry 

Instruction translation buffer page table entry 

temporary 

Instruction translation buffer ZAP 

Programmable array logic (PAL) base address 

Processor status 

Software interrupt enable 

Software interrupt request 

Clear serial line interrupt 

Serial line receive 

Serial line transmit 

Translation buffer tag 



IBX 


18 


IBX 


14 


IBX 


4 


IBX 


10 


IBX 


16 


IBX 


12 


IBX 


2 


IBX 


7 


IBX 


8 


IBX 


1 


IBX 


3 


IBX 


6 


IBX 


11 


IBX 


9 


IBX 


17 


IBX 


13 


IBX 


19 


IBX 


5 


IBX 


22 


IBX 






Load and Store Unit Registers 



ABOX_CTL Load and store unit (Abox) control 

ALT_MODE Alternate processor mode 

CC Cycle counter 



ABX 14 
ABX 15 
ABX 16 



*HW_MFPR and HW_MTPR instruction fields: PAL, ABX, IBX, and I ndex (<7,6,5,4:0>). 

(continued on next page) 
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Table 17 (Cont.) 21066-Specific Internal Processor Registers 



Mnemonic 



Register Name 



Field* Index* 



Load and Store Unit Registers 



CC_CTL 

DC_STATt 

C_5TATt 

DTBASM 

DTB_CTL 

DTBIS 

DTBPTE 

DTB_PTE_TEMP 

DTBZAP 

FLUSHJC 

FLUSH_IC_ASM 

MM_CSR 

VA 



Cycle counter control 

Data cache status 

Cache status 

Data translation buffer address space match 

Data translation buffer control 

Data translation buffer invalidate single 

Data translation buffer page table entry 

Data translation buffer page table entry 

temporary 

Data translation buffer ZAP 

Flush instruction cache 

Flush instruction cache address space match 

Memory management control and status 

Virtual address 



ABX 


17 


ABX 


12 


ABX 


12 


ABX 


7 


ABX 





ABX 


8 


ABX 


2 


ABX 


3 


ABX 


6 


ABX 


21 


ABX 


23 


ABX 


4 


ABX 


5 



PAL Temporary Registers 



PAL_TEMP<31:0> PAL_TEMP internal processor 



PAL 31.. 



*HW_MFPR and HW_MTPR instruction fields: PAL, ABX, IBX, and Index (<7,6,5,4:0>). 
tlmplemented in the21066A only. 
^Implemented in the 21066 only. 



Table 18 Memory Controller Registers 



Mnemonic 



Register Name 



Address 
(Hexadecimal) 



BCRO 


Bank configuration 


BCR1 


Bank configuration 1 


BCR2 


Bank configuration 2 


BCR3 


Bank configuration 3 


BMRO 


Bank mask 


BMR1 


Bank mask 1 


BMR2 


Bank mask 2 


BMR3 


Bank mask 3 


BTRO 


Bank timing 



12000 
12000 
12000 
12000 
12000 
12000 
12000 
12000 
12000 



0000 
0008 
0010 
0018 
0020 
0028 
0030 
0038 
0040 



(continued on next page) 
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Table 18 (Cont.) Memory Controller Registers 



Mnemonic 



Register Name 



Address 
(Hexadecimal) 



BTR1 


Bank timing 1 


BTR2 


Bank timing 2 


BTR3 


Bank timing 3 


GTR 


Global timing 


ESR 


Error status 


EAR 


Error address 


CAR 


Cache control 


VGR 


Video and graphics control 


PLM 


Plane mask 


FOR 


Foreground 


PMR* 


Power management register 



2000 
2000 
2000 
2000 
2000 
2000 
2000 
2000 
2000 
2000 
2000 



0048 
0050 
0058 
0060 
0068 
0070 
0078 
0080 
0088 
0090 
0098 



•Implemented in the21066A only. 



Table 19 I/O Controller Registers 



Mnemonic 



Register Name 



Address 
(Hexadecimal) 



IOC_HAE 

IOCCFG 

I OC_STAT0 

IOC_STATl 

IOC_TBIA 

IOC_TB_ENA 

IOC_SFT_RST 

IOC_PAR_DIS 

IOC_W_BASE0 

IOC_W_BASEl 

IOC_W_MASK0 

IOC_W_MASKl 

IOC_T_BASE0 

IOC_T_BASEl 

IOC_TB_TAG0 

IOC_TB_TAGl 

IOC_TB_TAG2 

IOC_TB_TAG3 

IOC TB TAG4 



Host address extension 

Configuration cycle type 

Status 

Status 1 

Translation buffer invalidate all 

Translation buffer enable 

PCI soft reset 

Parity disable 

Window base 

Window base 1 

Window mask 

Window mask 1 

Translated base 

Translated base 1 

Translation buffer tag 

Translation buffer tag 1 

Translation buffer tag 2 

Translation buffer tag 3 

Translation buffer tag 4 



8000 
8000 
8000 
8000 
8000 
8000 
8000 
8000 
8000 
8000 
8000 
8000 
8000 
8000 
8100 
8100 
8100 
8100 
8100 



0000 
0020 
0040 
0060 
0080 
00A0 
00C0 
00E0 
0100 
0120 
0140 
0160 
0180 
01A0 
0000 
0020 
0040 
0060 
0080 



(continued on next page) 



52 



Table 19 (Cont.) I/O Controller Registers 



Address 

Mnemonic Register Name (Hexadecimal) 

IOC_TB_TAG5 Translation buffer tag 5 1 8100 00A0 

IOC_TB_TAG6 Translation buffer tag 6 1 8100 00C0 

IOC_TB_TAG7 Translation buffer tag 7 1 8100 00E0 
IOC_IACK_SC Interrupt vector and special cycle 

•Any quadword-aligned address in the range 1A 0000000.. 1BFFFFFE0. 
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7 Instruction Summary 

The tables in this section summarize the common instructions implemented 
by the Alpha architecture, the PALmode instructions required by all Alpha 
implementations, and the architecturally reserved PALmode instructions 
implemented in the 21066 microprocessor. The instruction summaries are 
contained in the following tables: 

Instructions Table 

Memory integer load and store 20 

I nteger control 21 

Integer arithmetic 22 

Logical and shift 23 

Byte manipulation 24 

M emory format floati ng-poi nt 25 

Floating-point branch 26 

Floating-point operate 27 

Miscellaneous 28 

VAX compatibility instructions 29 

Required PALmode 30 

Architecturally reserved PALmode 31 



Table 20 Memory Integer Load and Store Instructions 
Mnemonic Operation 



LDA 


Load address 


LDAH 


Load address high 


LDL 


Load sign-extended longword 


LDL L 


Load sign-extended longword 


LDQ 


Load quadword 


LDQ L 


Load quadword locked 


LDQJJ 


Load quadword unaligned 


STL 


Store longword 


STL C 


Store longword conditional 


STQ 


Store quadword 


STQ C 


Store quadword conditional 


STQ U 


Store quadword unaligned 



locked 
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Table 21 Integer Control Instructions 



Mnemonic 



Operation 



BEQ 

BGE 

BGT 

BLBC 

BLBS 

BLE 

BLT 

BNE 



Branch 
Branch 
Branch 
Branch 
Branch 
Branch 
Branch 
Branch 



f register equal to zero 

f register greater than or equal to zero 

f register greater than zero 

f register low bit is clear 

f register low bit is set 

f register less than or equal to zero 

f register less than zero 

f register not equal to zero 



BR 
BSR 



Unconditional branch 
Branch to subroutine 



J MP 
J SR 
RET 
J SR_COROUTINE 



J ump 

J ump to subroutine 
Return from subroutine 
J ump to subroutine return 



Table 22 Integer Arithmetic Instructions 



Mnemonic 



Operation 



ADD 

S4ADD 

S8ADD 



Add quadword/longword 
Scaled add by 4 
Scaled add by 8 



CMPEQ 
CMPLT 
CMPLE 



Compare signed quadword equal 
Compare signed quadword less than 
Compare signed quadword less than or equal 



CMPULT 
CMPULE 



Compare unsigned quadword less than 
Compare unsigned quadword less than or equal 



MUL 
UMULH 



Multiply quadword/longword 
Multiply quadword unsigned high 



SUB 

S4SUB 

S8SUB 



Subtract quadword/longword 
Scaled subtract by 4 
Scaled subtract by 8 
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Table 23 Logical and Shift Instructions 



Mnemonic 



Operation 



AND 


Logical product 


BIC 


Logical product with complement 


BIS 


Logical sum (OR) 


EQV 


Logical equivalence (XORNOT) 


ORNOT 


Logical sum with complement 


XOR 


Logical difference 


CMOVxx 


Conditional move integer 


SLL 


Shift left logical 


SRA 


Shift right arithmetic 


SRL 


Shift right logical 



Table 24 Byte-Manipulation Instructions 



Mnemonic 



Operation 



CMPBGE 


Compare byte 


EXTBL 


Extract byte low 


EXTWL 


Extract word low 


EXTLL 


Extract longword low 


EXTQL 


Extract quadword low 


EXTWH 


Extract word high 


EXTLH 


Extract longword high 


EXTQH 


Extract quadword high 


INSBL 


Insert byte low 


INSWL 


Insert word low 


INSLL 


Insert longword low 


INSQL 


Insert quadword low 


INSWH 


Insert word high 


INSLH 


Insert longword high 


INSQH 


Insert quadword high 


MSKBL 


Mask byte low 


MSKWL 


Mask word low 


MSKLL 


Mask longword low 


MSKQL 


Mask quadword low 



(continued on next page) 
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Table 24 (Cont.) Byte-Manipulation Instructions 



Mnemonic 



Operation 



MSKWH 

MSKLH 

MSKQH 

ZAP 
ZAP NOT 



Mask word high 
Mask longword high 
Mask quadword high 

Zero bytes 
Zero bytes not 



Table 25 Memory Format Floating-Point Instructions 



Mnemonic 



Operation 



Subset 



LDF 
LDG 
LDS 
LDT 

STF 
STG 
STS 
STT 



Load F_floating VAX 

Load G_floating (load D_floating) VAX 

Load S_floating (load longword integer) IEEE and VAX 

Load T_floating (load quadword integer) IEEE and VAX 

Store F_floating VAX 

Store G_floating (store D_floating) VAX 

Store S_floating (store longword integer) IEEE and VAX 

Store T_floating (store quadword integer) IEEE and VAX 



Table 26 Floating-Point Branch Instructions 



Mnemonic 



Operation 



Subset 



FBEQ 
FBGE 
FBGT 
FBLE 
FBLT 
FBNE 



Floating branch equal lEEEandVAX 

Floating branch greater than or equal IEEE and VAX 

Floating branch greater than IEEE and VAX 

Floating branch less than or equal IEEE and VAX 

Floating branch less than lEEEandVAX 

Floating branch not equal IEEE and VAX 
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Table 27 Floating-Point Operate Instructions 



Mnemonic 



Operation 



Subset 



Arithmetic Operations 



ADDF 
ADDG 
ADDS 
ADDT 

CMPGxx 
CMPTxx 

CVTDG 
CVTGD 
CVTGF 
CVTGQ 
CVTQF 
CVTQG 
CVTQS 
CVTQT 
CVTST 
CVTTQ 
CVTTS 

DIVF 
DIVG 
DIVS 
DIVT 

MULF 
MULG 
MULS 
MULT 

SUBF 
SUBG 
SUBS 
SUBT 



Add F_floating 
Add G_floating 
Add S_floating 
Add T_floating 

Compare G_floating 
Compare T_floati ng 



Convert 
Convert 
Convert 
Convert 
Convert 
Convert 
Convert 
Convert 
Convert 
Convert 
Convert 



D_floating 
G_floating 
G_floating 
G_floating 
quadword 
quadword 
quadword 
quadword 
S_floating 
T_floating 
T_floating 



toG_floating 
to D_floating 
to F_floating 
to quadword 
to F_floating 
toG_floating 
to S_floating 
toT_floating 
toT_floating 
to quadword 
to S_floating 



Divide F_floating 
Divide G_floating 
Divide S_floating 
Divide T_floating 

Multiply F_floating 
Multiply G_floating 
Multiply S_floating 
Multiply T_floating 

Subtract F_floating 
Subtract G_floating 
Subtract S_floating 
Subtract T_floating 



VAX 
VAX 

IEEE 
IEEE 

VAX 

IEEE 

VAX 

VAX 

VAX 

VAX 

VAX 

VAX 

IEEE 

IEEE 

IEEE 

IEEE 

IEEE 

VAX 
VAX 

IEEE 
IEEE 

VAX 
VAX 

IEEE 
IEEE 

VAX 
VAX 

IEEE 
IEEE 

(continued on next page) 
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Table 27 (Cont.) Floating-Point Operate Instructions 



Mnemonic 



Operation 



Subset 



Bit and FPCR Operations 



CPYS 

CPYSE 

CPYSN 

CVTLQ 
CVTQL 



Copy sign 

Copy sign and exponent 

Copy sign negate 

Convert longword to quadword 
Convert quadword to longword 



I E E E and VAX 
I E E E and VAX 
I E E E and VAX 

I E E E and VAX 
IEEE and VAX 



FCMOVxx 

MF_FPCR 
MT FPCR 



Floating conditional move IEEE and VAX 

Move from floating-point control register IEEE and VAX 

Move to floating-point control register IEEE and VAX 



Table 28 Miscellaneous Instructions 



Mnemonic 


Operation 


CALL PAL 


Call privileged architecture library routine 


EXCB 


Exception barrier 


FETCH 


Prefetch data 


FETCH M 


Prefetch data, modify intent 


MB 


Memory barrier 


RPCC 


Read process cycle counter 


TRAPB 


Trap barrier 


WMB 


Write memory barrier 



Table 29 VAX Compatibility Instructions 



Mnemonic 



Operation 



RC 
RS 



Read and clear 
Read and set 
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Table 30 Required PALmode Instructions 



Mnemonic Operation 



HALT Halt processor 

I M B I instruction stream memory barrier 



Table 31 Architecturally Reserved PALmode Instructions 



Mnemonic Operation 



H W_MTPR Move data to processor register 

H W_M F P R M ove data from processor regi ster 

H W_L D M ove data from memory 

H W_ST M ove data to memory 

HW_REI Return from PALmode exception 
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Technical Support and Ordering Information 

Technical Support 

If you need technical support or help deciding which literature best meets your 
needs, call the Digital Semiconductor Information Line: 

United States and Canada 1-800-332-2717 
Outside North America +1-508-628-4760 

Ordering Digital Semiconductor Products 

To order the Alpha 21066 or Alpha 21066A microprocessors, contact your local 
distributor. 

You can order the following semiconductor products from Digital: 

Product Order Number 

21066-166 microprocessor 21066- AA 

21066A-233 microprocessor 21066-AB 

21066A- 100 microprocessor 21066-CB 

21066A- 266 microprocessor 21066-DB 



Ordering Associated Literature 

The following table lists some of the available Digital Semiconductor literature. 
For a complete list, contact the Digital Semiconductor Information Line. 

Title Order Number 

Alpha Architecture Reference Manual 1 EY-L520E-DP-YCH 

^o order and purchase the Alpha Architecture Reference Manual, call 1-800-DIGITAL from 
the U.S. or Canada, or contact your local Digital office, or technical or reference bookstore where 
Digital Press books are distributed by Prentice Hall. 



Ordering Third-Party Literature 

You can order the following third-party literature directly from the vendor. 



Title 



Vendor 



PCI Local Bus Specification, 
Revision 2.0 



IEEE Standard for Binary 
Floating-Point Arithmetic 
(ANSI /IEEE Standard 754-1985) 



PCI Special Interest Group 
1-800-433-5177 (U.S.) 
1-503-797-4207 (International) 
1-503-234-6762 (FAX) 

IEEE Service Center 
445 Hoes Lane 
P.O. Box 1331 
Piscataway, NJ 08855-1331 
1-800-678-1 EEE (U.S. and Canada) 
908-562-3805 (Outside U.S. and Canada) 



